On a robust F0 estimation of speech based on IRAPT using robust TV-CAR analysis
نویسندگان
چکیده
Fundamental frequency (F0) estimation is important in speech processing such as speech coding, synthesis, recognition and so on. A present F0 estimation method performs well under clean condition, however the performance deteriorates significantly in noisy environment. As a result, robust F0 estimation against additive noise is demanded. We have previously proposed F0 estimation methods based on Time-Varying Complex AR (TVCAR) analysis whose criterion is the weighted correlation of the complex residual obtained by the TV-CAR analysis, sum of the harmonics for the complex residual spectrum, or so on. On the other hand, E.Azarov et al. have proposed an improved method of RAPT (Robust Algorithm for Pitch Tracking) using an instantaneous harmonics that is called IRAPT (Instantaneous RAPT). The IRAPT can perform better estimation than RAPT. Since IRAPT uses band-limited analytic signal to obtain harmonic frequencies, the complex residual signal obtained by the TV-CAR analysis can also be applied to the IRAPT. In this paper, novel F0 estimation method using the instantaneous frequency based on the robust ELS (Extended Least Square) TV-CAR residual is proposed and evaluated.
منابع مشابه
F0 estimation using SRH based on TV-CAR speech analysis
This paper proposes novel robust speech F0 estimation using SRH (Summation Residual Harmonics) based on TV-CAR (Time-Varying Complex AR) analysis. We have already proposed robust F0 estimation based on the TV-CAR analysis in which weighted auto-correlation for complex residual signals is used as the criterion. In the SRH method, the criterion is calculated from LP residual signals. The criterio...
متن کاملImprovement of the ELS-based time-varying complex speech analysis
We have already proposed novel robust parameter estimation algorithms of time-varying complex AR (TV-CAR) model for analytic speech signal, which are based on GLS (Generalized Least Square) and its modification of ELS (Extended Least Square). The ELS method is more sophisticated method that can be derived by matrix inversion theorem. We have shown that the methods can achieve robust speech spec...
متن کاملA time-varying complex AR speech analysis based on GLS and ELS method
We have already developed three kinds of time-varying complex AR (TV-CAR) parameter estimation algorithms for analytic speech signal, which are based on minimizing mean square error (MMSE), Huber's robust Mestimation and Instrumental Variable (IV) method. This paper presents novel robust TV-CAR model parameter estimation algorithms on the basis of a Generalized Least Square (GLS) and Extended L...
متن کاملF0 estimation based on robust ELS complex speech analysis
A robust fundamental frequency (F0) estimation algorithm based on robust ELS (Extended Least Square) complexvalued speech analysis for an analytic speech signal is proposed in this paper. Speech spectrum can be accurately estimated in low frequencies since the analytic signal provides spectrum only over positive frequencies. The remarkable feature makes it possible to realize more accurate F0 e...
متن کامل